Perceptual Coding of Audio Using Signal - AdaptiveFilterbanks
نویسندگان
چکیده
This thesis studies the application of signal-adaptive lter banks in perceptual coding of audio with an emphasis on Wavelet Filter Banks (WFB). It provides an overview of perceptual coding of audio, the motivating psychoacoustic principles, transforms and wavelet theory. Additionally, di erent existing wavelets-based audio coding schemes are presented. The aim of most of the schemes is to overcome the poor frequency localization of the wavelet transform. Finally, a simple MLT-based coder and a signal-adaptive wavelet-based coder are implemented and tested. The proposed wavelet coder employs a hybrid lter bank where the rst stage uses a Discrete Wavelet Transform (DWT) and the second stage uses one or more Discrete Cosine Transforms (DCT) of variable length and the coding gain is used as switching criterion. Two adaptive bit allocations are available. One method uses a theoretically derived relation that assumes a uniform distribution of the quantization error and the other one uses an operational rate distortion measure. The samples are quantized with a linear scalar quantizer and the rate is estimated via entropy. The coder is tested for di erent wavelet lter sets as well as for di erent DCT lengths. It turns out that the adaptive DCT stage yields a clear improvement especially for harmonic signals like piano music that have shown to be problematic for wavelet coders. 2 CONTENTS
منابع مشابه
Blind Tamper Detection in Audio using Chirp based Robust Watermarking
In this paper, we propose the use of ‘chirp coding’ for embedding a watermark in audio data without generating any perceptual degradation of audio quality. A binary sequence (the watermark) is derived using energy based features from the audio signal and chirp coding used to embed the watermark in audio data. The chirp coding technique is such that the same watermark can be derived from the ori...
متن کاملVery Low Bit Rate Audio Coding Development
Audio coding at very low bit rates requires very efficient evaluation of characteristics of the signal sources and of perceptual properties of the human ear. An overview of models for the signal source employed in different coding schemes is given. Some of the basic concepts for perceptual quantization and coding of model parameters are shown, including approaches for scaleability with respect ...
متن کاملPEAQ based Psychoacoustic Model Implementation using Wavelet Packet Decomposition
Audio compression is the lossy compression technique of converting audio signal into an efficiently encoded bitstream that can be decoded to produce a close approximation of the original signal. For the purpose of improving the coding this work attempts to combine psychoacoustic model for perceptual evaluation of audio quality in BS.1387 with perceptual audio coder. The implementation of this n...
متن کاملAudio Coding Using Perceptually Controlled Bitstream Buffering
Perceptual audio coders use a varying number of bits to encode subsequent frames according to the perceptual entropy of the audio signal. For transmission over a constant bitrate channel the bitstream must be buffered. The buffer must be large enough to absorb variations in the bitrate, otherwise the quality of the audio will be compromised. We present a new scheme for buffer control of percept...
متن کاملTree and filter optimization for audio compression in a wavelet-based perceptual audio coder
This paper outlines a new perceptual low bit rate audio coding scheme based on adapted wavelet representations. It claims wavelet tree and filter adaptation attending to a perceptual entropy-based method. To achieve such adaptive structure, a periodized wavelet packet transform is performed for each audio frame. After the transform, the encoder employs scalar adaptive quantization, controlled b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006